BIOTECHNOLOGY gefl ro ^ 



RAW SEQUENCE LISTING 
ERROR REPORT 




The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 

Application Serial Number: /oj 6 S3, (o$l 

Source: / £-'t J ja " 

Oate Processed by*STIC; <? /Xyo^ ~ 

' THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 
PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION AND PATENTIN SOFTWARE QUESTIONS, PLEASE CONTACT 
MARK SPENCER, TELEPHONE: 703008-42 1 2; FAX: 703008-4221 
Effective 12/13/03 : TELEPHONE: 571-272-2510; FAX: 571-273-0221 



TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 4.1 PROGRAM , ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW FOR ADDRESS: 

http://\vw\v.uspto.govAvcb/ofTices/pac/chcckcr/chkr41notc.lUni 



Applicants submitting genetic scquciKe information electronically on diskette or CD-Rom should be aware that there is 
a possibility that the disk/CD-Rom may have been affected by treatment ^ven to all incoming mail. 
Please consider using alternate methods of submission for Uffffisk/CD-Rom or replacement disk/CD-Rom. * -A* 

Any reply including a sequence listing in electronic fornrshoold NOfTc sent to the 2023 1 zip code address for the 
United States Patent and Trademark Office, and instead should be sent via the following to the indicated addresses: 

1. EFS-Bio (<http://www.uspto.gov/ebc/efs/do\vnloads/documcnts.Iitni> , EFS Submission 
User Manual -cPAVE) 

2. U.S. Postal Service: Commissioner for Patents, P.O. Box 1450, Alexandria, VA 223 13-1450 

3. Hand Carry directly to (EFFECTIVE 12/01/03): 

U.S. Patent and Trademark Office, Box Sequence, Customer. Window, Lobby, Room 1B03, Crystal Plaza Two. 
201 1 South Clark Place, Arlington, VA 22202 

4. Federal Express, United Parcel Service, oe other delivery service to: U.S. Patent and Trademark Office, 

Box Sequence, Room 4B03-Mailroom, Crystal Plaza Two, 201 1 South Clark Pftcc, Arlington, VA 22202 

* 

Revised 10/OS/03 



Raw Sequence Listing Error Summary 



u NUMBER: _ 



ERROR DETECTED SUGGESTED CORRECTION SERIAL I 

ATTN: NEW RULES CASES: PLEASE DISREGARD ENGLISH "ALPHA" HEADERS, WHICH WERE INSERTED BY PTO SOFTWARE 

1 Wrapped Nuclcics The number/text at the end of each line "wrapped" down to the next line. This may occur if your file 

Wrapped Aminos was retrieved in a word processor after creating it. Please adjust your right margin to .3; this will 
prevent "wrapping." 

2 Invalid Line Length The rules require that a line not exceed 72 characters in length. This includes white spaces. 

3 _Misaligned Amino The numbering under each 5 th amino acid is misaligned. Do not use tab codes between numbers; 

Numbering use space characters, instead. 



10 



12 



Non-ASCII 



Variable Length 



Patentln 2.0 
" "bug" 



The submitted file was not saved in ASCII(DOS) text, as required by the Sequence Rules. Please 
ensure your subsequent submission is saved in ASCII text. 

Sequence(s) contain n's or Xaa's representing more than one residue. Per Sequence Rules, 

each n or Xaa can only represent a single residue. Please present the maximum number of each 
residue having variable length and indicate in the <220>-<223> section that some may be missing. 

A "bug" in Patentln version 2.0 has caused.the <220>-<223> section to be missing from amino acid 

sequences(s) . Normally, Patentln would automatically generate this section from the 

previously coded nucleic acid sequence. Please manually copy the relevant <220>-<223> section to 
the subsequent amino acid sequence. This applies to the mandatory <220>-<223> sections for 
Artificial or Unknown sequences. 



_Skipped Sequences 
(OLD RULES) 



Skipped Sequences 
(NEW RULES) 



Use of n's or Xaa's 
(NEW RULES) 



Jnvalid <2I3> 
Response 

Useof<220> 



Patentln 2.0 
"bug" 



Sequence(s) missing. If intentional, please insert the following lines for each skipped sequence: 

(2) INFORMATION FOR SEQ ID NO:X: (insert SEQ ID NO where "X" is shown) 
(i) SEQUENCE CHARACTERISTICS: (Do not insert any subheadings under this heading) 
(xi) SEQUENCE DESCRIPTION:SEQ ID NO:X: (insert SEQ ID NO where "X" is shown) 
This sequence is intentionally skipped 

Please also adjust the "(ii) NUMBER OF SEQUENCES:" response to include the skipped sequences. 

Sequence(s) missing. If intentional, please insert the following lines for each skipped sequence. 

<210> sequence id number 
<400> sequence id number 
000 

Use of n's and/or Xaa's have been detected in the Sequence Listing. 

Per 1.823 of Sequence Rules, use of <220>-<223> is MANDATORY if n's or Xaa's arc present. 

In <220> to <223> section, please explain location of n or Xaa, and which residue n or Xaa represents. 

Per 1.823 of. Sequence Rules, the only valid <213> responses are: Unknown, Artificial Sequence, or 
scientific name (Genus/species). <220>-<223> section is required when <2I3> response is Unknown or 
is Artificial Sequence 



Seauencc(s) 



missing 



jse of <220> to <223> is MANDATORY if <213> "Organism" response is 
"Unknown." Please explain source of genetic material in <220> to <223> section. 
(See "Federal Register," 06/01/1998, Vol. 63, No. 104, pp. 29631-32) 



(Sec. 1.823 of Sequence Rules) 



Please do not use "Copy to Disk" function of Patentln version 2.0. This causes a corrupted file, 
resulting in missing mandatory numeric identifiers and responses (as indicated on raw sequence 
listing). Instead, please use "File Manager" or any other manual means to copy file to floppy disk. 



13 Misuse of n/Xaa "n" can only represent a single nucleotide ; "Xaa" can only represent a single amino acid 



AMC - Biotechnology Systems Branch - 09/09/2003 
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IFWO 



RAW SEQUENCE LISTING DATE: 03/29/2004 

PATENT APPLICATION: US/10/653 , 681 TIME: 13:18:10 

Input Set : A:\U 014798-3.ST25.txt 
Output Set: N:\CRF4\03292004\J653681.raw 

3 <110> APPLICANT: Dai, Ken-Shwo 

5 <120> TITLE OF INVENTION: HUMAN ARL-RELATED GENE VARIANTS ASSOCIATED WITH CANCER 

7 <130> FILE REFERENCE: U 014798-3 

9 <140> CURRENT APPLICATION NUMBER: 10/653,681 

10 <141> CURRENT FILING DATE: 2003-09-02 

12 <160> NUMBER OF SEQ ID NOS : 4 

14 <170> SOFTWARE: Patentln version 3.2 

16 <210> SEQ ID NO: 1 

-1"7 ^- O 1 1 -v. T XTM/"' T> U . 



DostffaF Comply 
"*ed Diskette Needea 



18 <212> TYPE:' DNA 




19 <213> ORGANISM^ 

21 <220> FEATURE: 

22 <223> OTHER INFORMATION! 
25 <220> FEATURE: 



26 <221> NAME/KEY 

27 <222> LOCATION 
29 <400> SEQUENCE 




UNKNOWN 



CDS 

(70) ... (333) 
1 

30 caaaaacagc aacagaaagc aggacgtgag acttctacct gctcactcag aatcatttct 

32 gcaccaacc atg gcc acg ttt gtg gag etc agt acc aaa gec aag atg ccc 

33 Met Ala Thr Phe Val Glu Leu Ser Thr Lys Ala Lys Met Pro 

34 1 5 10 

36 att gtg ggc ctg ggc act tgg aag tct cct etc ggc aaa gtg aaa gaa 

37 He Val Gly Leu Gly Thr Trp Lys Ser Pro Leu Gly Lys Val Lys Glu 



38 15 



20 



40 gca gtg aag gtg gcc att gat 



25 30 
gca gga tat egg cac att gac tgt gcc 



41 Ala Val Lys Val Ala He Asp Ala Gly Tyr Arg His He Asp Cys Ala 



42 
44 



35 



40 



45 



tat gtc tat cag aat gaa cat gaa gtg ggg gaa gcc ate caa gag aag 
45 Tyr Val Tyr Gin Asn Glu His Glu Val Gly Glu Ala He Gin Glu Lys 



46 
48 
49 
50 



50 



55 



60 



ate caa gag aag get gtg aag egg gag gac ctg ttc ate gtc age aag 

He Gin Glu Lys Ala Val Lys Arg Glu Asp Leu Phe He Val Ser Lys 
65 70 75 

52 ttg tgg ccc act tec aga teg aga age tct tgaacaaacc tggactgaaa 

53 Leu Trp Pro Thr Ser Arg Ser Arg Ser Ser 

54 80 85 

56 tataaaccag tgactaacca ggttgagtgt cacccatacc teaegcagga gaaactgatc 

58 cagtactgcc actccaaggg catcaccgtt acggcctaca gccccctggg ctctccggat 

60 agaccttggg ccaagccaga agacccttcc ctgctggagg atcccaagat taaggagatt 

62 getgeaaage acaaaaaaac cgcagcccag gttctgatcc gtttccatat ccagaggaat 

64 gtgattgtca tccccaagtc tgtgacacca geaegcattg ttgagaacat tcaggtcttt 

66 gactttaaat tgagtgatga ggagatggca accatactca gcttcaacag aaactggagg 

68 gcctgtaacg tgttgcaatc ctctcatttg gaagactatc ccttcgatgc agaatattga 



60 
111 



159 



207 



255 



303 



353 



413 
473 
533 
593 
653 
713 
773 



(/JUL jCfe^H 
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RAW SEQUENCE LISTING DATE: 03/29/2004 

PATENT APPLICATION: US/10/653 , 681 TIME: 13:18:10 

Input Set : A:\U 014798-3.ST25.txt 
Output Set: N:\CRF4\03292004\J653681.raw 

70 ggttgaatct cctggtgaga ttatacagga gattctcttt cttcgctgaa gtgtgactac 

72 ctccactcat gtcccatttt agccaagctt atttaagatc acagtgaact tagtcctgtt 

74 atagacgaga atcgaggtgc tgttttagac atttatttct gtatgttcaa ctaggatcag 

76 aatatcacag aaaagcatgg cttgaataag gaaatgacaa ttttttccac ttatctgatc 

78 agaacaaatg tttattaagc atcagaaact ctgccaacac tgaggatgta aagatcaata 



80 
83 
84 
85 
86 



aaaaaaataa taatcat 



89 
91 



SEQ ID NO: 
LENGTH: 88 
TYPE: PRT 

ORGANISM/ APT I VI C1AL 
FEATURE: 

OTHER INFORMA I I li/f INK OWN 
SEQUENCE: 2 

Thr Phe Val Glu Leu Ser Thr Lys Ala Lys 
5 10 




<210> 
<211> 
<212> 
<213> 
<220> 
<223> 

<4 00> _ 

93 Met Ala Thr Phe Val Glu Leu Ser Thr Lys Ala Lys Met Pro He Val 

94 1 5 10 15 

97 Gly Leu Gly Thr Trp Lys Ser Pro Leu Gly Lys Val Lys Glu Ala Val 

98 ' " 20 2 5 30 

101 Lys Val Ala He Asp Ala Gly Tyr-Arg His He Asp Cys Ala Tyr Val 

102 35 40 45 

105 Tyr Gin Asn Glu His Glu Val Gly Glu Ala He Gin Glu Lys He Gin 

106 50 55 60 

109 Glu Lys Ala Val Lys Arg Glu Asp Leu Phe He Val Ser Lys Leu Trp 

110 65 J 70 75 80 

113 Pro Thr Ser Arg Ser Arg Ser Ser 

114 85 

117 <210> SEQ ID NO: 3 

118 <211> LENGTH: 1279 

119 <212> TYPE: DNA 

120 <213> ORGANISMS ARTIFICIAL 

122 <220> FEATURE: 

123 <223> OTHER INFORMATION UNKNOWN 

126 <220> FEATURE: 

127 <221> NAME /KEY : CDS 

128 <222> LOCATION: (70).. (804) 

130 <400> SEQUENCE: 3 

131 caaaaacagc aacagaaagc aggacgtgag acttctacct gctcactcag aatcatttct 
133, gcaccaacc atg gcc acg ttt gtg gag etc agt acc aaa gec aag atg ccc 

134 Met Ala Thr Phe Val Glu Leu Ser Thr Lys Ala Lys Met Pro 

135 15 10 

137 att gtg ggc ctg ggc act tgg aag tct cct etc ggc aaa gtg aaa gaa 

138 He Val Gly Leu Gly Thr Trp Lys Ser Pro Leu Gly Lys Val Lys Glu 

139 15 20 25 30 

141 gca gtg aag gtg gcc att gat gca gga tat egg cac att gac tgt gcc 

142 Ala Val Lys Val Ala He Asp Ala Gly Tyr Arg His He Asp Cys Ala 

143 35 40 45 

145 tat gtc tat cag aat gaa cat gaa gtg ggg gaa gcc ate caa gag aag 

146 Tyr Val Tyr Gin Asn Glu His Glu Val Gly Glu Ala He Gin Glu Lys 

147 50 55 60 

149 ate caa gag aag get gtg aag egg gag gac ctg ttc ate gtc age aag 




833 
893 
953 
1013 
1073. 
1090 



60 
111 



159 



207 



255 



303 
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RAW SEQUENCE LISTING DATE: 03/2 9/2004 

PATENT APPLICATION: US/10/653 , 681 TIME: 13:18:10 



Input Set : A:\U 014798-3.ST25.txt 
Output Set: N:\CRF4\03292004\J653681.raw 



150 


He 


Gin 


Glu 


Lys 


Ala 


Val 


Lys 


Arg 


Glu 


Asp 


Leu 


Phe 


He 


Val 


Ser 


Lys 




151 






65 










70 










75 










153 


ttg 


tgg 


ccc 


act 


ttc 


ttt 


gag 


aga 


ccc 


ctt 


gtg 


agg 


aaa 


gec 


ttt 


gag 


351 


154 


Leu 


Trp 


Pro 


Thr 


Phe 


Phe 


Glu 


Arg 


Pro 


Leu 


Val 


Arg 


Lys 


Ala 


Phe 


Glu 




155 




80 










85 










90 












157 


aag 


acc 


etc 


aag 


gac 


ctg 


aag 


ctg 


age 


tat 


ctg 


gac 


gtc 


tat 


ctt 


att 


399 


158 


Lys 


Thr 


Leu 


Lys 


Asp 


Leu 


Lys 


Leu 


Ser 


Tyr 


Leu 


Asp 


Val 


Tyr 


Leu 


He 




159 


95 










100 










105 










110 




161 


cac 


tgg 


cca 


cag 


gga 


ttc 


aag 


tct 


ggg 


gat 


gac 


ctt 


ttc 


ccc 


aaa 


gat 


447 


162 


His 


Trp 


Pro 


Gin 


Gly 


Phe 


Lys 


Ser 


Gly 


Asp 


Asp 


Leu 


Phe 


Pro 


Lys 


Asp 




163 










115 










120 










125 






165 


gat 


aaa 


ggt 


aat 


gec 


ate 


ggt 


gga 


aaa 


gca 


acg 


ttc 


ttg 


gat 


gee 


tgg 


495 


166 


Asp 


Lys 


Gly 


Asn 


Ala 


He 


Gly 


Gly 


Lys 


Ala 


Thr 


Phe 


Leu 


Asp 


Ala 


Trp 




167 








130 










135 










140 








169 


gag 


gec 


atg 


gag 


gag 


ctg 


gtg 


gat 


gag 


ggg 


ctg 


gtg 


aaa 


gee 


ctt 


ggg 


543 


170 


Glu 


Ala 


Met 


Glu 


Glu 


Leu 


Val 


Asp 


Glu 


Gly 


Leu 


Val 


Lys 


Ala 


Leu 


Gly 




171 






145 










150 










155 










173 


at c 

ZJ ~~ ~ 


t cc 


aat 


ttc 


age 


cac 


ttc 


cao 


ate 


aaa 


aag 


etc 


ttg 


aac 


a a a 


cct 


591 


174 


Val 


Ser 


Asn 


Phe 


Ser 


His 


Phe 


Gin 


He 


Glu 


Lys 


Leu 


Leu 


Asn 


Lys 


Pro 




175 




160 










165 










170 












177 


gga 


ctg 


aaa 


tat 


aaa 


cca 


gtg 


act 


aac 


cag 


gtt 


gag 


tgt 


cac 


cca 


tac 


639 


178 


Gly 


Leu 


Lys 


Tyr 


Lys 


Pro 


Val 


Thr 


Asn 


Gin 


Val 


Glu 


Cys 


His 


Pro 


Tyr 




179 


175 










180 










185 










190 




181 


etc 


acg 


cag 


gag 


aaa 


ctg 


ate 


cag 


tac 


tgc 


cac 


tec 


aag 


ggc 


ate 


acc 


687 


182 


Leu 


Thr 


Gin 


Glu 


Lys 


Leu 


He 


Gin 


Tyr 


Cys 


His 


Ser 


Lys 


Gly 


He 


Thr 




183 










195 










200 










205 






185 


gtt 


acg 


gec 


tac 


age 


ccc 


ctg 


ggc 


tct 


ccg 


gat 


aga 


cct 


tgg 


gec 


aag 


735 


186 


Val 


Thr 


Ala 


Tyr 


Ser 


Pro 


Leu 


Gly 


Ser 


Pro 


Asp 


Arg 


Pro 


Trp 


Ala 


Lys 




187 








210 










215 










220 








189 


cca 


gaa 


gac 


cct 


tec 


ctg 


ctg 


gag 


gat 


ccc 


aag 


att 


aag 


gag 


att 


get 


783 


190 


Pro 


Glu 


Asp 


Pro 


Ser 


Leu 


Leu 


Glu 


Asp 


Pro 


Lys 


He 


Lys 


Glu 


He 


Ala 




191 






225 










230 










235 










193 


gca 


aag 


cac 


tec 


cca 


agt 


ctg 


tgacaccagc acgcattgtt gagaacattc 


834 


194 


Ala 


Lys 


His 


Ser 


Pro 


Ser 


Leu 






















195 




240 










245 






















197 


aggtctttga < 


ctttaaattg agtgatgagg 


agatggcaac 


catactcagc • 


ttcaacagaa 


894 


199 


actggagggc • 


ctgtaacgtg ttgeaatect 


ctcatttgga 


agactatccc 


ttcgatgcag 


954 


201 


aatattgagg 


ttgaatctcc tggtgagatt 


atacaggaga 


ttctctttct ' 


tegctgaagt 


1014 


203 


gtgactacct < 


ccactcatgt cccattttag 


ccaagcttat 


ttaagatcac . 


agtgaactta 


1074 


205 


gtcctgttat . 


agacgagaat cgaggtgctg 


ttttagacat' 


ttatttctgt « 


atgttcaact 


1134 


207 


aggatcagaa 


tatcacagaa aagcatggct 


tgaataagga 


aatgacaatt 


ttttccactt 


1194 


209 


atctgatcag . 


aacaaatgtt tattaagcat 


cagaaactct 


gccaacactg « 


aggatgtaaa 


1254 


211 


gatcaataaa . 


aaaaataata atcat 




















1279 


214 


<210> SEQ ID NO: 


4 


























215 


<211> LENGTH: 245 


























216 


<212> TYPE: 


PRT 




























217 


<213> ORGANISM:/ 


ARTIFICIAL^ 






















219 


<220> FEATURE: ^ 




























220 


<223> OTHER 


INFORMATION; 


/ UNKNOWN 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/10/653 , 681 



DATE: 03/29/2004 
TIME: 13:18:10 



Input Set : 
Output Set: 



A:\U 014798-3.ST25.txt 
N:\CRF4\03292004\J653681.raw 



222 <400> SEQUENCE: 4 

224 Met Ala Thr Phe Val Glu Leu Ser Thr Lys Ala Lys Met Pro He Val 

225 1 5 10 15 

228 Gly Leu Gly Thr Trp Lys Ser Pro Leu Gly Lys Val Lys Glu Ala Val 

229 20 25 30 

232 Lys Val Ala He Asp Ala Gly Tyr Arg His He Asp Cys Ala Tyr Val 

233 35 4 0 4 5 

236 Tyr Gin Asn Glu His Glu Val Gly Glu Ala He Gin Glu Lys He Gin 

237 50 55 60 

240 Glu Lys Ala Val Lys Arg Glu Asp Leu Phe He Val Ser Lys Leu Trp 

241 65 70 75 80 

244 Pro Thr Phe Phe Glu Arg Pro Leu Val Arg Lys Ala Phe Glu Lys Thr 

245 85 90 95 

248 Leu Lys Asp Leu Lys Leu Ser Tyr Leu Asp Val Tyr Leu He His Trp 

249 100 105 110 

252 Pro Gin Gly Phe Lys Ser Gly Asp Asp, Leu Phe Pro Lys Asp Asp Lys 

253 115 120 125 

256 Gly Asn Ala He Gly Gly Lys Ala Thr Phe Leu Asp Ala Trp Glu Ala 

257 130 135 . 140 

260 Met Glu Glu Leu Val Asp Glu Gly Leu Val Lys Ala Leu Gly Val Ser 

261 145 150 155 160 

264 Asn Phe Ser His Phe Gin He Glu Lys Leu Leu Asn Lys Pro Gly Leu 

265 165 170 175 

268 Lys Tyr Lys Pro Val Thr Asn Gin Val Glu Cys His Pro Tyr Leu Thr 

269 180 185 190 

272 Gin Glu Lys . Leu He Gin Tyr Cys His Ser Lys Gly He Thr Val Thr 

273 195 200 205 

276 Ala Tyr Ser Pro Leu Gly Ser Pro Asp Arg Pro Trp Ala Lys Pro Glu 

277 210 215 220 

280 Asp Pro Ser Leu Leu Glu Asp Pro Lys He Lys Glu He Ala Ala Lys 

281 225 230 235 240 

284 His Ser Pro Ser Leu 

285 245 
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RAW SEQUENCE LISTING ERROR SUMMARY DATE: 03/2 9/2004 

PATENT APPLICATION: US/10/653 , 681 . TIME: 13:18:11 

Input Set : A:\U 014798-3.ST25.txt 
Output Set: N:\CRF4\03292004\J653681.raw 

Invalid <2 13> Respon se : 

Use of "Artificial" only as "<213> Organism" response is incomplete, 

per 1.823(b) of New Sequence Rules. Valid response is Artificial Sequence. 

Seq#:l,2,3,4 
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VERIFICATION SUMMARY DATE: 03/2 9/2004 

PATENT APPLICATION: US/10/653 , 681 TIME: 13:18:11 

Input Set : A:\U 014798-3.ST25.txt 
Output Set: N:\CRF4\03292004\J653681.raw 
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